Rank | Count | Beginning |
---|---|---|
2753 | 998 | Die |
1659 | 429 | Das |
2279 | 420 | Der |
7883 | 290 | Sie |
6069 | 289 | In |
4828 | 237 | Es |
849 | 188 | Bei |
3521 | 175 | Diese |
5879 | 174 | Im |
4640 | 134 | Er |
485 | 131 | Auch |
4309 | 127 | Eine |
4278 | 126 | Ein |
9334 | 121 | Wenn |
8658 | 108 | Und |
7035 | 95 | Mit |
8204 | 92 | So |
9620 | 91 | Wir |
5724 | 89 | Ich |
5215 | 88 | Für |
1325 | 80 | Dabei |
191 | 79 | Als |
4156 | 75 | Durch |
7180 | 71 | Nach |
280 | 64 | Am |
1319 | 62 | Da |
9547 | 61 | Wie |
617 | 57 | Auf |
9456 | 55 | Wer |
9 | 54 | Aber |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV